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Abstract 

What is the powerful ingredient which allows a dramatic speed-up of quantum com- 
putation over classical computation ? We propose that this ingredient is an implicit 
use of the Bayesian probability theory Furthermore, we argue that both classical and 
quantum computation are special cases of probability reasoning. On these grounds, intro- 
ducing Bayesian probability theory in classical computation as well, we reduce a typical 
NP problem, namely 3-SAT, to a linear programming problem. According to algorithmic 
complexity theory, this proves that P=NP. 

1 Introduction 

Computation of a deterministic function is fundamentally a conclusive process. Therefore, 
the insertion of randomness in algorithms should be regarded as counterproductive. By 
contrast, Leonid Levin, an outstanding former student of Kolmogorov claims on his home 
page [l| that randomness is really 'native' to Computer Science. In the present paper, 
we strongly support this claim and argue that this is basically the consequence of the 
profound identity between logic and probability as stated in Bayesian theory, even if this 
direction may not be exactly in line with the Kolmogorov views. To reach our goal, we 
deal at the intersection of several fields: Algorithmic computation, algorithmic complexity, 
quantum computation and of course Bayes probability. In each domain, we pick out a very 
few ingredients, often at the simplest elementary level. Together, these ingredients lead 
surprisingly to a straightforward but exceptional achievement: Any logical algorithm can 
be formulated as a linear programming problem. Specially, a basic question of logical sat- 
isfiability with n variables, namely 3-SAT, is equivalent to a linear programming problem 
with 0(n 3 ) unknowns and even in general with O(n) unknowns. According to algorithmic 
complexity theory, this proves that P=NP. 

This paper is organized as follows: We shortly skate over the foundation of the domains 
of interest through an historical sketch placing emphasis on the implicated ingredients. 
Next, we describe how to specify logical algorithms in terms of conditional probability. 
We finally apply this model to the 3-SAT problem before concluding. 



2 Hints on the different domains 
2.1 Foundation of algorithmic computation 

The concept of algorithm as a process of effective calculability was formulated indepen- 
dently by Alonzo Church [2| and Alan Turing Q in 1936. In the modern formulation, 
the so-called Church-Turing principle, states that any algorithm can be performed with 
a 'deterministic Turing machine'. 
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In the late 1950s, Michael Rabin [4] described a two tape automaton introducing the 
idea of nondeterministic Turing machines and suggesting the use of a new ingredient 
in computation, namely, randomness. In 1970, E. R. Berlekamp 0| described such an 
algorithm requiring a sequence of trials selected at random. Later, in 1977, Robert Solovay 
and Volker Strassen 0| found that the problem of integer primality, at this time out of 
reach of tractable deterministic algorithms, was settled by randomization. It should be 
noticed that the deep reason of this improbable efficiency remained basically unknown. 
Whatever that may be, the concept of Turing machine had to be extended to account 
for these results. Nowadays, the so-called strong Church-Turing principle postulates that 
any classical computation can be performed with 'a non deterministic Turing machine.' 
However, the insertion of randomness in classical algorithms consisted mainly in selecting 
input data at random. This remained pretty 'miserable' when compared with the next 
advance : quantum computation. 



2.2 Quantum computation 

Following ideas by Richard Feynman David Deutsch discovered quantum computation 
in 1985 [8]. This process makes use of formidable quantum resources, but remains definitely 
a probabilistic algorithm. Compared to classical algorithms, the function to be computed 
is prepared in form of a physical setup of n-registers or regions. A standard assignment is 
inserted as a joint probability distribution over the regions. Physically, the joint probability 
distribution is derived from a wave function living in a Hilbert space, but for what we are 
concerned, only the probability distribution is of interest. Each region, called a qubit, is the 
quantum analogue of a classical bit. The final measurement is a random trial where each 
qubit 'collapses' into a classical bit with a deterministic value, say or 1. Generally, the 
qubit distributions are not independent and the regions are known as entangled. Otherwise, 
the qubit are called separable. A quantum algorithms is a physical setup, composed of 
suitable quantum gates and convenient settings which mimic logical functions [9]. In 
practice, quantum algorithms are described in form of circuits, where wires stand for 
qubits and gates are selected among a tool kit (l"o| . 

In some cases, this method was proven to extend the class of tractable problems and 
provided a substantial speed-up over classical algorithms (ill ). Specially, Peter Shor fl~2| 
found in 1994 an efficient quantum process to factorize integers in polynomial time, while 
such an algorithm is still unknown in classical computation. According to Deutsch, 'the 
theory of computation' became 'the quantum theory of computation', even if we have to 
recognize that efficient quantum computers are not likely to be available in the foreseeable 
future. 

Again, the main reasons of the supposedly quantum efficiency are basically unknown, 
but the common wisdom is that entanglement should be the key ingredient. Indeed, in the 
quantum community, it is 'widely believed that classical systems cannot simulate highly 
entangled quantum systems' [13j ). By contrast, we have nevertheless argued 14| that 
the concept of entanglement is actually a quite classical attribute of contextual systems. 
Furthermore, any classical computer can be regarded as a highly entangled system as far 
as the entanglement is measured between the binary digits of the processed data during 
the computation. In addition, we shall argue that the crucial ingredient is not in the least 
a Hilbert space with its full quantum machinery but only a flexible randomization, or to 
be exact, the implicit use of Bayes probabilities. This is like a grin without a cat in Alice 
and Bob adventures in Quantum Land. 



2.3 Bayes probability 



In the probability domain, at least two conceptions have been widely debated |15| . Accord- 
ing to the 'Frequentists', probability should be regarded as a real property of an object. 
By contrast, for the 'Bayesians', probability characterizes not the object but the level of 
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knowledge of the observer on the object. The Bayesian conception was implicitly admitted 
by Bernouilli, Laplace or Poincare. But at the beginning of the 20th century, the Frequen- 
tist view became more and more popular, possibly in accordance with the 'Copenhagen 
interpretation' of quantum mechanics. This interpretation states that quantum random- 
ness is absolute, 'ontological', in opposition with Einstein views, convinced that 'God does 



not play dice'. But after World War II, the works of George Polya [16], Richard Cox 17] 



Claude Shannon fl8| and Edwin Jaynes 15 1 gave new supports to the Bayesian vision. 
Polya demonstrated the strong connection between 'probability', 'plausibility', 'logic' and 
'common sense'. Cox derived the actual mathematical structure of probability theory in 
accordance with both Polya vision and Boolean operations. Eventually, Jaynes synthesised 
these results and conceived the maximum entropy principle, accounting for Shannon in- 
formation theory. This gives its overall consistency to the Bayesian conception (which, in 
our opinion, should be rather called 'Jaynesian conception'). Now, as already formulated 
by Laplace, this theory is explicitly thought as an extension of Aristotle logic to situa- 
tions where the knowledge of the observer is incomplete. Therefore, the mathematical 
tool for logic, namely Boole algebra, has to be extended to the algebra of events, namely 
a Kolmogorov probability space. In turn, we plead in the present paper that algorithmics, 
whether classical or quantum, has to be regarded as a special case of probability reasoning. 
But what about complexity? 

2.4 Algorithmic complexity 

It is well-known that logical algorithms may never terminate. This was the origin of 
the works of Turing 0] in 1936. Among the algorithms that halt, J. von Neumann [19] 
noticed in 1953, in the context of game theory, that some algorithms only terminate 
after an exponential time. In the sixties, appeared the need to establish a hierarchy to 
characterize the computation efficiency. The issue was first addressed by Rabin (2o| in 1960 



and precisely formulated by Alan Cobham [2l| . Juris Hartmanis & Richard Stearns [2 



and Manual Blum [23fl in 1964-1967. A basic class of complexity is the so-called class P 
of problems 'solvable by a deterministic algorithm within a number of steps bounded by 



some fixed polynomial in the length of the input' 24|. A typical class P problem is to 
check whether an explicit Boolean function is or not satisfied by a given assignment. Other 
important P problems are elementary arithmetic operations, solvability of linear systems, 



and noticeably, solvability of linear programming 25] (even if the most popular algorithm, 
the Dantzig simplex method (26| . may be exponential in the worst case). This class is 
considered as the set of tractable problems. 

By contrast, class NP, originally defined as the 'set of problems recognizable in polyno- 
mial time by a nondeterministic Turing machine', consists of problems for which a given 
solution can be checked in polynomial time. The classification of NP-problems is domi- 
nated by the discovery of NP-completeness. Informally, it is a subclass of NP such that any 
NP-algorithm can be reduced in polynomial time to an algorithm of this subclass. This 
concept was discovered by Stephen Cook in a seminal paper in 1971 (2tJ and independently 
by Leonid Levin (28| in 1973. Therefore, if one NP-complete algorithm is in P, then all 
NP-algorithms are in P which implies that P = NP. In 1972, Richard Karp (29| reviewed 
the mains algorithms in the different classes. Among NP-complete algorithms, we find 
satisfiability (or SAT) of a conjunction of clauses, [27J (an specially 3-SAT with exactly 
three literals per clause), 0-1 integer programming, or in graph theory, determination of 
cliques, set packing, Hamilton circuits, etc. 

Now, an appealing bridge between P and NP algorithms is randomness. This is the 
object of the present paper. 
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3 Bayesian formulation of algorithms 



For simplicity, we consider the Boolean algebra of n dichotomic variables Xj where i G [1, n] 
with two possible assignments Xj and Xj. Throughout this paper, depending upon the 
context, we will also use 1, TRUE or (i) for Xj ; and 0, FALSE or (— i) for Xj. As in 
information theory [3(3], given decision functions f, fi, f2, it will be convenient to note f 
the complement of f, (fi;f2) the conjunction of fi and \<i and (fi,f2) the disjunction of fi 
and f2. We will name requirement any literal (i.e. Xj or Xj) or conjunction of literals like 
(Xjj Xj] . . . ; Xfc), complete requirement a requirement of n literals like S = (Xi; X2; . . . ; X n ) 
and partial requirement a requirement with less than n literals. Similarly, when needed, 
we will use the terms of partial assignments, e.g. (i; —j; . . . ; k), and complete assignments, 
e.g. (l;-2;...;n). 

In algorithmics, a literal is considered as a deterministic function and its assignment 
as a deterministic constant. 

By contrast, we propose to regard literals Xj (or Xj as well) as random variables, with 
two possible outcomes and 1. Therefore requirements and more generally decision func- 
tions are ipso facto random variables, while assignments are still deterministic constants. 

Now, an event is a decision function and the outcomes are the possible assignments, 
e.g., Xj = Xj or f = 1. The basic sample set of probability is the ensemble f2 = {H} of 
cardinality 2 n of all complete requirements 5, labeled by all complete assignments £ = 
(±1; ±2; . . . ; ±n). Let T be the sigma-algebra over SI and P a probability distribution 
on T . The Kolmogorov probability space is now (J7,T, P). Clearly, the sigma-algebra 
T of cardinality 2 2 " can be identified with the set of all classical decision functions of n 
variables. 

In Bayesian theory, the probability distribution, P, is conditional and depends upon 
prior information. Since we are primary concerned by classical decision functions f £ T, 
we will compute probability distributions conditioned by the prior f = TRUE. Very occa- 
sionally, we will also consider wider conditions defined by specific equations not necessarily 
related to a classical decision function, for instance quantum conditions. 

A decision function f € T is satisfiable if and only if f 7^ 0. In terms of probability 
distribution, this means that the probabilities P(£|f) of all 2 n complete events of 0, given 
that f is TRUE, sum to 1. By contrast, if f = 0, no conditional probability distribution 
P(£|f) can exist. 

For i,j,--- € [1, n] we consider partial requirements given that f is TRUE, e.g, (Xj[f), 
(Xi]Xj\f), (Xi;X 3 -;Xfc|f), (Xj; X 2 ; . . . ; X n |f). For simplicity, we have noted e.g., (Xj;Xj|f) 
for ((Xj; Xj)|f), omitting some parenthesis. We will call partial probabilities the probabili- 
ties of partial requirements. Recall that we note P(Xj = Xj|f) (and P(Xj = Xj|f) as well) as 
P(i|f), P(Xj = Xj|f) (and P(Xj = Xj|f)) as P(— i|f) and the probability of complete events 
P(3 = £|f) as P(£|f). Again we note P(i; j\f) for P((i; j)\f). 

Irrespective of f 7^ 0, by probability rules, partial probabilities conditioned by f = 
TRUE, obey the following universal relations: 



P(±i;±j;±fc;...|f)>0 (1) 

l = P(i|f)+P(-i[f) (2) 

P(t|f)=P(*;j|f)+P(t;-i|f) (3) 

P(»;j|f)=P(i;j;fc|f)+P(i;j;-fc|f) (4) 



etc., where k, . . . are signed integers and \i\, \k\, ■ ■ ■ € [1, n\ are distinct. It is easy 
to establish that we have m = n distinct equations like Eq.([2]), 4Q) distinct equations 
like Eq.Q, 12(g) distinct equations like Eq.Q, etc. Note that accounting for Eqs.([2j[3j 
IU etc.), Eq.([T]) implies that 

P(±i;±j;±fc;...|f)<l (5) 
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and 



P(i[f) = => P(i;j|f) = P(*;j;fc|f) = 0; (6) 
P(*;j;fc|f) = 1 => P(*;j|f) = 1 => P(*|f) = 1. (7) 

While the linear equations Eqs.([TJ [21 O 01 etc.) express the rules of logic [17] and 
are therefore universal, a particular decision function f can be described by a number of 
specific linear equations between partial probabilities. For instance, in the next section, 
we will construct the specific linear equations of a 3-SAT problem (cf. Eq.© below). 
The logical consistency of the specific equations is precisely checked by reference to the 
universal equations. The system composed of the relevant universal relations and the spe- 



cific equations define a classical linear programming problem [31|, where the unknowns are 
the relevant partial probabilities. When feasible, each solution is a consistent probability 
distribution P over T. The linear programming problem is characterized by a convex 
polytope of solutions in the linear vector space spanned by the ensemble of all relevant 
partial probabilities. We will refer to this object as the f -polytope. It is known that such 



a convex polytope is described by its extreme points (or vertices) [321 ]. A trivial case is 
the tautological function U, with no specific equation. We will name its polytope the 
U-polytope. 

Separable events: Let e.g. Ho ==' (Xi; X2; ...;X n ) be a complete requirement and 

£0 (%i]%2i ■ ■ ■ j x n) the corresponding complete assignment, so that So = £0 with 
certainty. We have by definition P(xi;x2', ■ ■ ■ , x n |Eo) = 1 and any partial probability like 
P(xi; Xj] Xfc|Ho) is or 1. These conditional events are also independent or 1 separable' 
(using quantum formulation). That is, e.g., 

P(z; j; -fc|Eo) = P(*|S ) • P(i|H ) • P(-k\E ) 

In words, given a complete requirement, conditional partial probabilities are deterministic 
and separable. More generally, from the universal relations Eqs.JIJ [21 El SI etc.), a deter- 
ministic distribution, with all partial probabilities equal to or 1, is separable, as stated 
by the following lemma: 

Lemma 1. Given a decision function f 7^ 0, all deterministic solutions of the linear 
programming system are separable. 

Proof: By hypothesis, all partial probabilities are or 1. We proceed by inspection. 
For example, for partial probabilities of three and less than three literals, let i, j, k be 
signed integers with distinct \k\ £ [l,n]]. Consider e.g., the unknown P (i; j ; k\f) . 

If at least one relevant unknown like P(i |f ) is zero then from Eq.© P(i;j\f) = 0, i.e., 
P(i;j|f) = P(*|f) • PO'IO and P(i;j;k\f) = 0, i.e., P(i;j;jfe|f) = P(i|f) • P(j|f) • P(fc[f). Now, 
if P(i|f) = 1, P(j|f) = 1 and P(fc|f) = 1 then from Eq.© P(-j|f) = 0, and from Eq.© 
P(i--j\f) = and P(i;j;-jfe|f) = 0. Therefore, from Eq.flS}, P(i;j|f) = P(i[f) = 1, i.e., 
P(f;j|f) = P(i|f)-P(i|f). Finally, from Eq.®, P(i;j;k\f) = P(i;j|f) = 1, i.e., P(i;j;fe|f) = 
P(i|f) • P(i|f) • P(fc|f ). We proceed similarly for more than three literals. The proof is 
completed. 



Entangled events: Now, let Hi and H2 be two distinct complete requirements. We con- 
sider the disjunction f = (Hi,^). Let £1,^2 be the two complete assignments satisfying 
Hi and H2 respectively. Thus, given f, the probability P(£|f) of any complete event is ex- 
cept for P(£i|f) and P(^2 [f )• Therefore, f is described by the Bayesian linear programming 
system, limited to the complete probabilities P(£|f), 

P(£i|f) + P(6l0 = i 
v^{6,6}: P(e|f) = o 

subject to : P(£|f) > 0. 
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The f-polytope is reduced to a line segment in the plane (P(£i |f), P(^2 |f ))• The extreme 
points of coordinates (1, 0) and (0, 1) correspond to the two events (£i|f) and (^lO- O n ^ ne 
other hand, we may consider combinations of £i and £2 within the edge [P(£i|f, P^IOl- 
Therefore, a number of non deterministic distributions are also solutions. For instance, 
in the middle of the edge, the two mutually exclusive events, H = £1 and S = £2 may 
be viewed as equally likely. In this case, partial probabilities, computed by enumerative 
combinatorics or simply by the probability product rule, read e.g. for three literals, 



If the partial assignments (i), (j) and (k) are different in £1 and £21 we will describe 
events like (i\f), (i;j;k\f) etc., as 'entangled', (using quantum formulation). This 

result can be extended to any logical combination of requirements. In summary, non 
deterministic partial probabilities are generally entangled. 

Classical bit: Specially, consider a system with just a single variable. A one bit tau- 
tology f = U is a satisfiable decision function described solely by the universal conditions 
Eqs.(Q] [2]), with two possible assignment (1) and (—1) and no specific equation. The linear 
programming problem reads 



with two deterministic exclusive solutions, namely TRUE and FALSE, or (1) and (— 1). 
The U-polytope of the classical bit is reduced to a one dimensional segment between the 
two points (0,1) and (1,0) in the plane (P(-1|U), P(1|U)). 

Still with one variable, the specific equation, P(l|f) = defines the function f = Xi 
while P(— l|f) = describes the function f = Xi. The f-polytopes are both reduced to one 
isolated vertex. Theses functions are obviously 'classical' in the sense that they accept 
deterministic solutions. More generally, we will name 'classical' a linear programming 
problem that accepts deterministic solutions. 

Quantum bit: Linear programming is fully compatible with quantum information as 
well. However, a strictly quantum solution implicitly defined by a linear programming 
system cannot be deterministic, because before collapse, 'in the quantum world maximal 
information is not complete and cannot be completed' (quoting Caves et al (33|). This has 
to be stipulated by a specific equation. For instance, a qubit Q is defined by the system 



In the classical world, this describes as well a one-bit system with incomplete informa- 
tion, e.g., a supposedly fair coin, given that it has not been tossed. The linear program- 
ming problem is still feasible but only accepts a single non- deterministic solution. The 
Q-polytope of the qubit is reduced to the isolated vertex, P(— 1|Q) = P(1|Q) = 1/2. 
Therefore, no deterministic function is defined nor of course satisfiable. In other words, P, 
conditioned by Q is a solution of the linear programming problem but Q £ T: Q is only 
defined by its specific equation. 

Beyond the qubit, in quantum information or in classical systems with incomplete 
information, feasible solutions P of a linear programming problem are probability distri- 
butions conditioned by specific equations that does not necessarily refer to an element of 
T. In a way, the set of specific equations defines an extension Q of the set of decision 
functions (but not of the sigma-algebra which remains unchanged) and for instance Q £ Q. 
To sum up, a problem with incomplete information is similar to a deterministic problem 
except that no point of its polytope is deterministic. 




universal equation : P(1|U) + P(— 1|U) = 1 

subject to P(1|U) > 0, P(-1|U) > 0. 



universal equation : 
specific equation : 
subject to 



P(1|Q)+P(-1[Q) = 1 
P(1|Q) = P(-1|Q) 
P(1|Q) >0, P(-1|Q) >0. 
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However, we have to ask whether a decision function f defined by seemingly deter- 
ministic equations might eventually lead to a strictly quantum situation. The answer is 
negative: 

Lemma 2. Given a feasible linear programming problem, if all specific equations are de- 
rived from classical Boolean equations, then the problem accepts a deterministic solution. 

Proof: Since the linear programming problem is feasible, the system accepts a solution 
and a probability distribution P is defined on T ■ Specially, a probability distribution P(£|f) 
is defined on the sample set Q. On the other hand, since f is a classical Boolean function, 
it is possible to check whether f(£) is TRUE or FALSE for all complete assignments £ € O. 
If f(£) = FALSE for all £, then f = and no probability distribution is defined on fi: 
This contradicts the hypothesis that the problem is feasible. Therefore, the system accept 
a deterministic solution. 

All preceding considerations are valid for any Boolean function. Now, we are going to 
describe an example of Bayesian process to compute the solutions of an explicit decision 
function. For the purpose in hand, we will solve the 3-SAT problem. 

4 Polynomial time resolution of the 3-SAT problem 

The 3-SAT problem is to determine whether a function f, defined as a conjunction of m 
disjunctions (or clauses) with exactly three literals, is or not satisfiable. For instance, a 
particular clause C r may be 

C r ( = ) (Xj,Xj,Xfc) 

where i,j,k G [l,Ji] are distinct, r € [1, m] and of course < m < 8(3), (recall that wo 
note (fi,f2) the disjunction of two decision functions fx and f2). Therefore, we have by 
definition 

P(c r |C r ) = P(c r |f) = 1 

where (in this example) c r ( = f) (i, —j, k). It is convenient to use rather the complement of 
c r to obtain an equation in terms of partial probability, 

P(-i; j; -k\Cr) = P(-i; j; -k\f) = 0, (8) 

(recall that we note (i; j) the conjunction of two assignments i and j). In words, given 
that C r is TRUE, then the probability of the complement of C r to be TRUE is zero. The 
3-SAT problem is completely defined by m specific equations like Eq.^B\j. 

The consistency of the specific equations will be checked by reference to the relevant 
universal equations. Since the specific equations only involve requirements with three 
literals, requirements with more than three literals are irrelevant. Therefore, among the 
universal equations, only Eqs.([2]H|) may be relevant. Now, the 3-SAT problem is described 
by the linear system Eqs.flSHIB]) in R of (^)+4(™)+12(™)+m equations for 2(™)+4(™)+8Q) 
unknowns subject to the constraints 

P(±i|f) > 0; P(±i; ±i|f) > 0; P(±t; ±j; ±k\f) > (9) 

where i,j, k € are distinct. This defines a classical problem of linear programming 

with 0(n 3 ) unknowns. 

In fact, the linear system can be strongly simplified. For instance, for zero clause, the 
system is reduced to the universal problem and a set of zero equation is quite sufficient! 
Generally, for non trivial problems the number of clauses is m = O(n) because other- 
wise the satisfiability is easy to determine. Whence, only a limited number of unknowns 
involving three or less literals are relevant, depending upon the particular problem. For 
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example, the single clause defined by Eq.([8]) involves 3x2 relevant unknowns with one lit- 
eral, namely, P(±i|C r ), P(±j|C r ) and P(±£;|C r ), 3x4 relevant unknowns with two literals 
like P(±i; ±j|C r )) and 8 relevant unknowns with three literals, namely, P(=bi; ±j; ±fc|C r ), 
i.e., a set of 26 unknowns per clause, and thus, when accounting for overlapping, a total of 
about 20m unknowns is relevant to the problem. When m = O(n), the number of relevant 
unknowns is rather O(n) than 0(n 3 ). 

Definition. Given a 3-SAT problem, we will name auxiliary Bayes system the linear 
system defined by Eqs. ^E^W subject to constraints given by Eq. ^ and using only the 
relevant unknowns. 

If the 3-SAT problem is satisfiable, the linear programming problem accepts a deter- 
ministic separable solution and is ipso facto feasible. 

Conversely suppose that the linear programming problem, Eqs. (TT H4|8p . is feasible. 
Since the specific equations are derived from Boolean relations, Lemma [2] holds and the 
system accepts a deterministic solution. From Lemma [TJ this solution is separable. There- 
fore, we can compute P(i|f) = or 1 for |z| £ [l,n] and P(£|f) = 1 for some £ € £1. 
It is possible that an unknown with one literal like P(±i|f) is not relevant, and thus the 
problem does not depend on this particular variable and the two partial assignments (±i) 
are both valid. Eventually, the 3-SAT problem is satisfiable. 

Theorem. A 3-SAT problem is satisfiable if, and only if, its auxiliary Bayes system is 
feasible. 

Since the maximum value of m is 8(3) = 0(n 3 ), given that linear programming is in 



class P 25| | , the feasibility of the auxiliary Bayes system is obtained in polynomial time. 



Given finally that 3-SAT is NP- complete [27], we have the following result, 
Corollary. P=NP. 



Implementation The present Bayesian method stands basically as an enumeration 
of the relevant requirements. This appears rather disproportionate and even tedious for 
trivial issues. For example, with just a single clause and three variables, we have seen that 
we have 6 relevant unknowns with 1 literal, 12 unknowns with 2 literals and 8 unknowns 
with 3 literals, i.e., a total of 26 relevant unknowns. In addition, we have 1 specific equation 
like Eq.([8|), 3 equations like Eq.Q, 12 equations like Eq.([3]), and 12 equations like Eq.(|U), 
i.e. a total of 28 equations for 26 unknowns, just to solve a trivial problem! But on the 
other hand, the number of relevant parameters (unknowns and equations) scales as O(m) 
with generally m <C 0(n 3 ), therefore the complexity remains polynomial for real problems 
and the method becomes very powerful compared to exponential algorithms. 

When the problem is feasible, the initial step of the simplex method provides a feasible 
base but, in general, does not give a deterministic solution. Such a deterministic separable 
solution is obtained by optimazing a maximum of 2n objective functions. For example, 
we may first select the two objective functions z\ = P(±l|f). If the two maxima are both 
different from 1, then no deterministic solution is feasible which contradicts Lemma [2J 
Therefore, we have a maximum z\ = 1 for say z\ = P(— l|f). Now we might assign the 
value Xi = FALSE and the new problem with n — 1 variables would still be satisfiable. 
Instead, we select two objective functions Z2 = P(— l|f) +P(±2|f). Now, if the two maxima 
are both different from 2, then again this contradicts Lemma [2j Therefore the maximum 2 
is obtained e.g., for z 2 = P(-l|f) + P(2|f), we iterate with z 3 = P(-l|f)+P(2|f) + P(±3|f), 
etc. Finally, we select an optimum solution with P(±i|f) = 1 for i 6 [l,ra]]. 



Examples Let us give a few examples. In practice, using Eqs.(j2]|4]), it is convenient to 
first eliminate partial probabilities with one or two literals and only deal with the relevant 
partial probabilities of three literals. This preliminary elimination will be done in the 
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following examples. The first example, with a single clause, illustrates the fact that the 
method is basically an enumeration of the relevant partial probabilities. The seven ver- 
tices are deterministic while other solutions are entangled. The next case with two clauses 
illustrates how the universal equations Eqs. (|3|4p behave like consistency conditions. The 
following example with 7 variables shows that an unsatisfiable problem leads to an un- 
feasible system, while the linear system without constraints has infinitely many solutions. 
The next case with 10 variables illustrates the fact that vertex feasible solutions are not 
in general separable and a linear optimization is necessary to obtain the deterministic 
solutions. The last problem with 87 variables is the only non trivial example and the first 
sample where the present method can compete with conventional methods. 

One clause: For a single clause, only n = 3 variables are relevant. In this case, after 
elimination of unknowns of 1 and 2 literals, the system is reduced to an enumeration of 
the 8 partial probabilities of three literals. Let e.g., f ( = f) (X3,X2,Xi). We have one 
normalization equation and one specific equation. For simplicity, we will omit throughout 
the conditional symbol, e.g. P(3;2;l) for P(3;2;l|f). The linear programming problem 
reads, in simplex formulation, 

P(3; -2; -1) = 1 - P(-3; 2; -1) - P(3; 2; -1) - P(-3; -2; 1) 
-P(3;-2;1)-P(-3;2;1)-P(3;2;l) 

P(-3;-2;-l) = 
subject to P(±l; ±2; ±3) > 

The system is feasible. This formulation displays one particular deterministic vertex 
solution: the two basic unknowns are P(3; — 2;— 1) = 1 and P(— 3;— 2;— 1) = 0. Other 
unknowns are non basic and set to zero. The satisfying complete assignment £ is obtained 
using Eq.([7| and Eq.([2]), i.e., for the displayed vertex, £ = (3;— 2;— 1). The 6 other 
vertex solutions are also deterministic and obtained by straightforward pivoting from this 
configuration. 

Two clauses: Let n = 4 variables and e.g., f = [(X 3 , X 2 , Xi); (X 4 , X 3 , X 2 )]. The full 
problem has 44 unknowns and 50 equations. After elimination of the partial probabilities 
with less than 3 literals, we have 16 relevant unknowns and 12 equations, 

2 universal normalization equations, derived from Eq.([2]) namely 
P(-3; -2; -1) + P(3; -2; -1) + P(-3; 2; -1) + P(3; 2; -1) + 

P(-3; -2; 1) + P(3; -2; 1) + P(-3; 2; 1) + P(3; 2; 1) = 1 
P(-4; -3; -2) + P(4; -3; -2) + P(-4; 3; -2) + P(4; 3; -2) + 

P(-4; -3; 2) + P(4; -3; 2) + P(-4; 3; 2) + P(4; 3; 2) = 1 
4 universal consistency equations, derived from Eq.Q namely 
P(-3; -2; -1) + P(3; -2; -1) + P(-3; -2; 1) + P(3; -2; 1) 

= P(-4; -3; -2) + P(4; -3; -2) + P(-4; 3; -2) + P(4; 3; -2) 
P(-3; 2; -1) + P(3; 2; -1) + P(-3; 2; 1) + P(3; 2; 1) 

= P(-4; -3; 2) + P(4; -3; 2) + P(-4; 3; 2) + P(4; 3; 2) 
P(-3; -2; -1) + P(-3; 2; -1) + P(-3; -2; 1) + P(-3; 2; 1) 

= P(-4; -3; -2) + P(4; -3; -2) + P(-4; -3; 2) + P(4; -3; 2) 
P(3; -2; -1) + P(3; 2; -1) + P(3; -2; 1) + P(3; 2; 1) 

= P(-4; 3; -2) + P(4; 3; -2) + P(-4; 3; 2) + P(4; 3; 2) 
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4 universal consistency equations, derived from Eq.Q namely 

P(-3; -2; -1) + P(-3; -2; 1) = P(-4; -3; -2) + P(4; -3; -2) 
P(-3; 2; -1) + P(-3; 2; 1) = P(-4; -3; 2) + P(4; -3; 2) 
P(3; -2; -1) + P(3; -2; 1) = P(-4; 3; -2) + P(4; 3; -2) 
P(3; 2; -1) + P(3; 2; 1) = P(-4; 3; 2) + P(4; 3; 2) 

and 2 specific equations, namely 
P(-3;2;-l)=0 
P(4;-3;-2) =0 



The rank of the system is 7 and the linear programming problem reads, in simplex 
formulation, 

P(-3; -2; -1) = 1 - P(-3; -2; 1) - P(-4; 3; -2) - P(4; 3; -2) 

- P(-4; -3; 2) - P(4; -3; 2) - P(-4; 3; 2) - P(4; 3; 2) 
P(-4; -3; -2) = 1 - P(-4; 3; -2) - P(4; 3; -2) - P(-4; -3; 2) 

-P(4;-3;2)-P(-4;3;2)-P(4;3;2) 
P(3; 2; -1) = - P(3; 2; 1) + P(-4; 3; 2) + P(4; 3; 2) 
P(3; -2; 1) = - P(3; -2; -1) + P(-4; 3; -2) + P(4; 3; -2) 
P(-3; 2; 1) = + P(-4; -3; 2) + P(4; -3; 2) 
P(-3;2;-l) =0 
P(4;-3;-2) =0 



We set to zero the non-basic unknowns: The only non zero partial probabilities are 
P(— 3; —2; —1) = 1 and P(— 4; —3; —2) = 1 leading to the satisfying complete assignment 
£ = (—4; —3; —2; —1), obtained from Eqs.jH [2]) for the displayed vertex. Other solutions 
are obtained by pivoting. 

Example of non satisfiable problem, 27 clauses, n = 7: 

_ Let f_^ =) [(X3 ,_X2 , Xi ) ; (X 3 ,2<2,Xi); (X3,X.2,Xi); (X4,X 3 ,X2); (X 4 ,X2,Xi); (Xs,X2,Xi 
(X 5 ,X 4 ,X 2 ); (X 5 ,X4,X 3 ); (X 5 ,X 3 ,X 2 ); (X 5 ,X 4 ,X 3 ); (X 5 ,X4,X 3 ); (X 6 ,X 4 ,Xi); (X 6 ,X 5 ,X 4/ 
(X6,X 5 ,X 2 ); (X 6 ,X 5 ,X 3 ); (X 6 ,X 5 ,Xi); (X 6 ,X 5 ,Xi); (X 6 ,X 5 ,X 2 ); (X 7 ,X 2 ,Xi); (X 7 ,X 3 ,X 2 
(X 7 ,X 4 ,Xi); (X 7 ,X 4 ,X 2 ); (X 7 ,X 5 ,X 3 ); (X 7 ,X 3 ,X 2 ); (X 7 ,X 4 ,X 3 ); (X 7 ,X 5 ,X 4 ); (X 7 ,X 6 ,X4)] 
We have 160 relevant 3-literal unknowns, 20 normalization equations, 126 + 156 con- 
sistency equations and 27 specific equations, i.e., a total of 309 equations. The rank of the 
linear system is 139. When accounted for the constraints the linear programming problem 
is not feasible. Therefore, the 3-SAT system is not satisfiable. 

Example of satisfiable problem, 39 clauses, n = 10: 

Let f_= [(X4,X 3 ,X 2 ); (X4,X 3 ,Xi); (Xs,X 2 ,Xi); ()<5,X 2 ,Xi); (X5,X 3 ,X 2 ); (X5,)<4,X 2 ) 
(X 5 ,X 4 ,X 2 ); (X 6 ,X 3 ,Xi); (X 6 ,X 5 ,X 3 ); (X 6 ,X5,X 4 ); (X 7 ,X 5 ,X 2 ); (X 7 ,X 6 ,X 4 ); (X 8 ,X 3 ,X 2 ) 
(X 8 ,X 3 ,X 2 ); (X 8 ,X 4 ,X 3 ); (X 8 ,X5,X 4 ); (X 8 ,X 5 ,Xi); (X 8 ,X 6 ,Xi); (X 8 ,X 6 ,X 2 ); (X 8 ,X 4 ,Xi) 
(X 8 ,X 7 ,Xi); (X 8 ,X 7 ,X 3 ); (Xg,X 5 ,X4); (X 9 ^X 6 , X4); ^X 9 , X 7 , X 6 ); (X 9 ,X 7 ,X 4 ); (X^X^Xs) 
(X9,Xs,X 2 ); (XgjX^Xi); (Xg, X 6 , Xi)jJX 9 , X 8 , X 5 ); (Xi ,X 2 ,Xi); (Xi ,X 3 ,X 2 ); (Xi ,X 6 ,X 3 ); 
(Xio,X 7 ,Xi); (Xio,X 7 ,X 5 ); (Xi ,X 8 ,X 5 ); (Xi ,X 8 ,X 5 ); (Xio,Xg,X 5 )] 

We have 35 normalization equations, 190 + 244 consistency equations and 39 specific 
equations, i.e., a total 469 equations for 280 relevant 3-literal unknowns. The rank of the 
linear system is 230. The linear programming problem is feasible and accepts a single 
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deterministic solution. In our particular implementation, we find first a highly entan- 
gled basic solution. Therefore, we have to use a maximization process to compute the 
deterministic solution. 

Example of 232 clause problem with n = 87: This is the last and only non trivial 
example, obtained from the competition SAT 2001 Conference on Satisfiability testing 
and referred to as 'a permuted 3-SAT formula with seed=1442724072'. The particular 
'permuted' structure leads to a surprisingly simple linear system with only 464 relevant 3- 
literal unknowns for 464 equations, namely, 58 normalization equations, 174+0 consistency 
equations and 232 specific equations. The rank is 377 and the problem is feasible. 

We have computed the preceding instances with home-made codes in an old fashion 
language just to corroborate the principle. To obtain challenging results, we need to run 
an efficient linear programming package for sparse systems. Clearly, the scaling efficiency 
of the method depends solely upon this package. 

5 Conclusion 

Taking advantage of the close proximity between logic and Bayesian probability, we have 
shown that a particular NP-complete problem, namely 3-SAT, can be efficiently solved 
by linear programming, i.e., in polynomial time. Therefore, a long-standing problem 
of algorithmic complexity is settled and P=NP. This result is of course of outstanding 
importance in both practical and theoretical domains. 

Practically, the consequences are dramatic: No one-way function can exist and thus 
no public-key encryption system can be safe. It is urgently needed to revisit all current 
systems. This includes internet communication, smart cards, banking operations, digital 
signature, etc. Only symmetrical encryption with one-time pad may be safe but key 
transportation has to be defended from eavesdropping. 

Theoretically, the consequences are in no way less important. The P versus NP prob- 
lem is one of the seven Millennium Prize Problems of the Clay Mathematics Institute 
(CMI) [34 ] . According to Cook [27]] the present result should have 'stunning consequences 
and may well include all of the CMI prize problems'... 

In line with the present paper, we will just comment two issues. First, we have brought 
into play a new method of computation at an elementary level with a result surprisingly 
powerful. To put it in a nutshell, this method is basically an enumeration of the relevant 
partial probabilities subject to specific conditions. The consistency of these inputs is 
checked by linear programming with reference to universal probability laws, i.e., to Boolean 
logic. In NP-problems, the overall input size is bounded by a polynomial in the number 
of variables. Given that linear programming is of class P, the consistency is checked in 
polynomial time. The formal difference with usual algorithms is that we deal with partial 
probabilities that are set down as the very unknowns of the problem. In a way, this may 
be compared to the introduction of an unknown l x' to address 'arithmetic problem' in 
elementary curriculum, leading to a different conception landscape. We propose a similar 
paradigm to address Boolean problems. This should open up a promising domain beside 
conventional deterministic and probabilistic algorithms, utilizing the complete tool kit of 
both Bayesian (l5| and information [30] theories. Eventually, this also suggests a close 
connection between algorithmic complexity and Kolmogorov complexity. 

The second issue concerns quantum information. In fact, we started this work for 
understanding the exponential speed-up of quantum computation, given that we regard 
entanglement as a strictly classical attribute of contextual systems Q. Therefore, we 
suspected that entanglement was not the real booster, except if P=NP. Even highly im- 
probable, this second hypothesis turns out to be right. Actually, in Bayesian formulation, 
a form of entanglement is at work in both classical and quantum computation, while the 
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use of strictly quantum description does not seem to provide any decisive benefit. This 
gives new insights on the possible foundations of quantum theory, supporting the Bayesian 
approach proposed by Caves et al (33| and suggests that quantum mechanical structure 
could be basically a powerful mathematical tool describing something like an 'algebra of 
logical observables'. 



References 

[1] Leonid Levin, Home page, 



URL http : // www . cs . bu . edu/~lnd/ 



[2] A. Church, An unsolvable problem of elementary number theory, American Journal 
of Mathematics 58 (1) (1936) 345-353. 

[3] A. Turing, On computable numbers, with an application to the Entscheidungsproblem, 
Proc. of the London Mathematical Society 42 (1936) 230-265. 

[4] M. O. Rabin, D. Scott, Finite automata and their decision problems, IBM Journal of 
Research and Development 3 (2) (1959) 114. 

[5] E. R. Berlekamp, Factoring polynomials over large finite fields, Math. Comp. 24 (1970) 
713-735. 

[6] R. Solovay, V. Strassen, A fast Monte-Carlo test for primality, SIAM J. Comput. 6 (1) 
(1977) 84-85. 

[7] R. Feynman, Simulating physics with computers, International Journal of Theoretical 
Physics 21 (6/7) (1982) 467-488. 

D. Deutsch, Quantum theory, the Church-Turing principle and the universal quantum 
computer, Proc. R. Soc, London A 400 (1985) 97-117. 

[9] T. D. Ladd, F. Jelezko, R. Laflamme, Y. Nakamura, C. Monroe, J. L. O'Brien, 
Quantum computers, Nature 464 (2010) 45-53. llarXiv: 1009.22671 



[10] J. Watrous, |An introduction to quantum information and quantum circuits, ACM 



SIGACT News 42 (2) (2011) 52-67. 



URL http : //www . cs . uwaterloo . ca/~watrous/papers . html 



[11] E. Bernstein, U. Vazirani, Quantum complexity theory, in: STOC '93: Proceedings of 
the twenty-fifth annual ACM symposium on Theory of computing, ACM, New York, 
NY, USA, 1993, pp. 11-20. 

[12] P. W. Shor, Polynomial-time algorithms for prime factorization and discrete log- 
arithms on a quantum computer, SIAM J. Sci. Statist. Comput. 26 (1997) 1484. 
|arXiv: quant-ph/9508027} 

[13] J. Preskill, Quantum computing and the entanglement frontier (Mar. 2012). 
larXiv: 1203 .58131 

[14] M. Feldmann, Classical description of quantum randomness using stochastic gauge 



systems (2009). arXiv : 0911 . 1525 [quant-ph] 



[15] E. T. Jaynes, Probability Theory: The Logic of Science, Cambridge University Press, 
Cambridge, UK, 2003. 

[16] G. Polya, Les mathematiques et le raisonnement plausible, Gauthier-Villard, Gabay, 
Paris, 1958. 



12 



[17 
[18 
[19 

[20 
[21 
[22 
[23 
[24 
[25 
[26 

[27 

[28 
[29 

[30 

[31 

[32 
[33 



R. T. Cox, Probability, frequency, and reasonable expectation, American Journal of 
Physics 14 (1946) 1-13. 

C. E. Shannon, A mathematical theory of communication, Bell System Technical 
Journal 27 (1948) 379-423 , 656-715. 

H. J. von Neumann, A certain zero-sum two-person game equivalent to the optimal 
assignment problem, in: H. W. Kuhn, A. W. Tucker (Eds.), Contributions to the 
Theory of Games, vol. II, Ann. of Math. Studies, no. 28, Princeton, 1953, pp. 5-12. 

M. O. Rabin, Degree of difficulty of computing a function and a partial ordering of 
recursive sets, Tech. rep., O.N.R. No 1 Hebrew University, Jerusalem, Israel (1960). 

A. Cobham, The intrinsic computational difficulty of functions, in: Proc. Logic, 
Methodology, and Philosophy of Science II, North Holland, 1965, pp. 24-30. 

J. Hartmanis, R. E. Stearns, On the computational complexity of algorithms, Trans. 
AMS 117 (1965) 285-306. 

M. Blum, A machine-independent theory of the complexity of recursive functions, J. 
ACM 14 (2) (1967) 322-336. 

S. Cook, |The P versus NP problem| (2000). 

URL http : //www . claymath . org/millennium/P_vs_NP/pvsnp . pdf 

L. Khachiyan, A polynomial time algorithm for linear programming, Doklady Akod. 
Nouk SSSR 244 (5) (1979) 1093-1096. 

G. B. Dantzig, Maximization of a linear function of variables subject to linear in- 
equalities, in: T. Koopmans (Ed.), Activity Analysis of Production and Allocation, 
Wiley & Chapman-Hall, New York, London, 1951, pp. 339-347. 

S. Cook, The complexity of theorem proving procedures, in: Proceedings of the Third 
Annual ACM Symposium on Theory of Computing, ACM, New York, NY, USA, 1971, 
pp. 151-158. 



L. A. Levin, Universal search problems, Peredaci lnformocii 9 (1973) 115-116. 
URL http : //www . cs . bu . edu/f ac/lnd/research/publ . htm#dan73b 

R. M. Karp, Reducibility among combinatorial problems, in: R. E. Miller, J. W. 
Thatcher (Eds.), Complexity of Computer Computations, The IBM Research Sym- 
posia Series, Plenum Press, New York, NY, USA, 1972. 

R. W. Yeung, Information Theory and network coding, Springer- Verlag, New York, 
2008. 

T. H. Cormen, C. E. Leiserton, R. L. Rivest, C. Stein, Introduction to Algorithms, 
MIT Press, Cambridge,Ma., USA, 2009. 

K. G. Murty, Linear Programming, John Wiley & Sons, New York, 1983. 

C. M. Caves, C. A. Fuchs, R. Schack, Quantum probabilities as Bayesian probabilities, 



Phys. Rev. A 65 (2) (2002) 22305. arXiv:quant-ph/0106133 



[34] Clay Mathematics Institute, [link] 



URL http : //www . claymath . org/millennium/ 



13 



